8351500: G1: NUMA migrations cause crashes in region allocation #3607
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This is not a clean backport. The effected G1Allocator and G1Collector methods have changed since JDK17.
So this backports reimplements the patch in a minimally invasive way while retaining as much similarity as possible with the original patch.
The gist of the patch is clear: instead of finding out the NUMA node index at every instance of G1Allocator::allocate_xxx, and then be subject to NUMA node migrations, we fix the NUMA node index once and use that one.
I tested this patch with my "FakeNUMA" addition (I plan to upstream that one at some point). This FakeNUMA mode mimics a lot of NUMA node migrations. I can verify that without this patch the JVM crashes quickly, with the patch it does not crash.
Progress
Issue
Reviewing
Using
git
Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk17u-dev.git pull/3607/head:pull/3607
$ git checkout pull/3607
Update a local copy of the PR:
$ git checkout pull/3607
$ git pull https://git.openjdk.org/jdk17u-dev.git pull/3607/head
Using Skara CLI tools
Checkout this PR locally:
$ git pr checkout 3607
View PR using the GUI difftool:
$ git pr show -t 3607
Using diff file
Download this PR as a diff file:
https://git.openjdk.org/jdk17u-dev/pull/3607.diff
Using Webrev
Link to Webrev Comment